On The Marriage of Lp-norms and Edit Distance

نویسندگان

  • Lei Chen
  • Raymond T. Ng
چکیده

Existing studies on time series are based on two categories of distance functions. The first category consists of the Lp-norms. They are metric distance functions but cannot support local time shifting. The second category consists of distance functions which are capable of handling local time shifting but are nonmetric. The first contribution of this paper is the proposal of a new distance function, which we call ERP (“Edit distance with Real Penalty”). Representing a marriage of L1norm and the edit distance, ERP can support local time shifting, and is a metric. The second contribution of the paper is the development of pruning strategies for large time series databases. Given that ERP is a metric, one way to prune is to apply the triangle inequality. Another way to prune is to develop a lower bound on the ERP distance. We propose such a lower bound, which has the nice computational property that it can be efficiently indexed with a standard B+tree. Moreover, we show that these two ways of pruning can be used simultaneously for ERP distances. Specifically, the false positives obtained from the B+-tree can be further minimized by applying the triangle inequality. Based on extensive experimentation with existing benchmarks and techniques, we show that this combination delivers superb pruning power and search time performance, and dominates all existing strategies. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment. Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LP problems constrained with D-FRIs

In this paper, optimization of a linear objective function with fuzzy relational inequality constraints is investigated where the feasible region is formed as the intersection of two inequality fuzzy systems and Dombi family of t-norms is considered as fuzzy composition. Dombi family of t-norms includes a parametric family of continuous strict t-norms, whose members are increasing functions of ...

متن کامل

Studying the cohabitation subculture in metropolis Tehran (backgrounds and consequents)

Expended Abstract Introduction: In recent years, new methods of relationship between both sexes have emerged which have no family framework. Today, some girls and boys are living together although they have not got married legally and formally; based on a mutual agreement for an unspecified time. Some scholars express that Iran’s society has faced with increasing waves of changes in values an...

متن کامل

Studying the cohabitation subculture in metropolis Tehran (backgrounds and consequents)

Expended Abstract Introduction: In recent years, new methods of relationship between both sexes have emerged which have no family framework. Today, some girls and boys are living together although they have not got married legally and formally; based on a mutual agreement for an unspecified time. Some scholars express that Iran’s society has faced with increasing waves of changes in values an...

متن کامل

A Study on Social Structure, Marriage and a Romantic Relation

For marriage, we need at least a positive tendency but a forced or expedient marriage leads to divorce. Living in a society full of loving and respecting each other is necessary and it is an outcome of an organized family. This paper tries to analyze the effect of social structure on establishment of sustainable relation and emotional relationship. We can examine this subject from two point ...

متن کامل

Fast Time Sequence Indexing for Arbitrary Lp Norms

Fast indexing in time sequence databases for similarity searching has attracted a lot of research recently. Most of the proposals, however, typically centered around the Euclidean distance and its derivatives. We examine the problem of multimodal similarity search in which users can choose the best one from multiple similarity models for their needs. In this paper, we present a novel and fast i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004